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Abstract 

O . 

The problem of joint source-channel coding in transmitting independent sources over interference channels 
with correlated receiver side information is studied. When each receiver has side information correlated with 

> 

■ its own desired source, it is shown that source-channel code separation is optimal. When each receiver has side 

I information correlated with the interfering source, sufficient conditions for reliable transmission are provided 

(N ■ based on a joint source-channel coding scheme using the superposition encoding and partial decoding idea of 

Han and Kobayashi. When the receiver side information is a deterministic function of the interfering source, 
source-channel code separation is again shown to be optimal. As a special case, for a class of Z-interference 
channels, when the side information of the receiver facing interference is a deterministic function of the 
interfering source, necessary and sufficient conditions for reliable transmission are provided in the form of 



X 

^ . single letter expressions. As a byproduct of these joint source-channel coding results, the capacity region of a 

class of Z-channels with degraded message sets is also provided. 
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I. Introduction 

The wireless medium is shared by multiple cormnunication systems operating simultaneously, which 
leads to interference among users transmitting over the same frequency band. In the simple scenario 
of two transmitter-receiver pairs, the interference channel [1] models two simultaneous transmissions 
interfering with each other. In the classical interference channel model, the sources intended for 
each receiver are independent of each other, and the receivers decode based only on their own 
received signals. On the other hand, in applications such as sensor networks, it is reasonable to 
assume that the receivers have access to their own correlated observations about the underlying source 
sequences as well. These correlated observations at the receivers can be exploited to improve the 
system performance. 

Even in the absence of side information, a finite letter expression for the capacity region of an inter- 
ference channel in the general case is unknown. We have the capacity region in the case of interference 
channels with statistically equivalent outputs [2]-[4], discrete additive degraded interference channels 
[5], a class of deterministic interference channels [6], strong interference channels [7]-[ll], a class 
of degraded interference channels [12], and more recently for a class of Z-interference channels [13]. 
The best known achievable rate region is due to Han and Kobayashi [9], a simplification of which is 
given in [14]. 

In a point-to-point scenario, the availability of correlated side information at the receiver is con- 
sidered in [15]. It is shown that the source-channel separation theorem applies in this simple setting 
and, moreover, that Slepian-Wolf source coding followed by channel coding is optimal. With the 
availability of side information at the receiver, we can transmit the source reliably over a channel 
with smaller capacity than the one required when there is no receiver side information. However, it 
is known that the source-channel separation theorem does not generalize to multi-user channels [1], 
[16], and necessary and sufficient conditions for reliable transmission in the case of correlated sources 
and correlated receiver side information are not known in general. In [17], necessary and sufficient 
conditions are characterized for broadcasting a common source to multiple receivers with different 
correlated side information. An alternative achievability scheme for the setup of [17] is given in [18]. 
In [19], the results of [17] are extended to broadcast channels with degraded message sets in which 
the receivers have access to parts of the underlying messages. Availability of messages or message 
parts at the receivers of broadcast channels from the channel coding perspective is studied also in 
[20]-[22]. In [23], broadcasting a pair of correlated sources with correlated receiver side information 
is studied. 
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The interference channel with correlated sources is considered in [24], and a sufficient condition 
for reliable transmission is given. In [25], an interference channel with independent sources, in which 
each receiver has access to side information correlated with the interfering transmitter's source, is 
considered. Necessary and sufficient conditions for this setup are characterized under the strong 
source-channel interference conditions, which generalize the usual strong interference conditions by 
considering correlated side information as well. The result of [25] shows that interference cancellation 
is optimal even when the underlying channel interference is not strong, as long as the overall source- 
channel interference is. 

In this paper, we extend the scenario studied in [25] to more general interference channels. We first 
consider the case in which each receiver has side information correlated with the source sequence it 
wants to decode. We prove the optimality of source-channel code separation in this situation; that is, 
the optimal performance can be achieved by first compressing each of the sources using Slepian-Wolf 
coding with respect to the correlated receiver side information, and then transmitting the compressed 
bits over the channel using an optimal interference channel code. 

Next, we consider the scenario in which each receiver has side information correlated with the 
interfering transmitter's source. As an example of such a model and to illustrate the benefits of side 
information about the interfering source, consider the extreme case in which each receiver has access 
to the message of the interfering transmitter. Note that this setup is equivalent to the restricted two- 
way channel model of Shannon, whose capacity is characterized in [1]. In this case, each receiver 
can excise the interference from the undesired transmitter, since its message is exactly known at the 
receiver. Here, we consider the more general case of arbitrary correlation between the receiver side 
information and the interfering source, and propose a joint source-channel coding scheme similar to 
that of Han and Kobayashi [9] taking the side information into account. Later, we consider the case in 
which the side information is a deterministic function of the interfering source, and show that source- 
channel code separation is again optimal. Finally, we consider a special class of interference channels 
called Z-interference channels, in which only one receiver faces interference. Further focusing on a 
special class of Z-interference channels satisfying certain conditions (which will be stated later), and 
the case in which the side information is a deterministic function of the interfering source, we are able 
to characterize necessary and sufficient conditions for reliable transmission in the form of single letter 
expressions. This setting also constitutes an example for which the general sufficiency conditions we 
provide are also necessary, proving their tightness for certain special cases. 

The rest of the paper is organized as follows. In Section |II] we present the system model. In Section 
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Hill we prove the optimality of source-channel code separation when the side information is correlated 
with the desired source. The case in which the side information is correlated with the interfering 
source is considered in Section |IVl In Section IIV-AI we provide sufficient conditions for reliable 
transmission, while in Section ITV-B I we prove the optimality of source-channel code separation when 
the side information is a deterministic function of the interfering source. In Section ITV-CI we show that, 
for a special source and channel model, the sufficient conditions for reliable transmission proposed in 
Section IIV-AI are also necessary, and hence we give a single letter characterization of the necessary 
and sufficient conditions for this model. In Section |V] we characterize the capacity region of a class 
of Z-channels with degraded message sets. This is followed by conclusions in Section |Vll 

II. System Model 

An interference channel is composed of two transmitter-receiver pairs. The underlying discrete 
memoryless channel is characterized by the transition probability p{yi,y2\xi, X2) from finite input 
alphabet Xi x X2 to finite output alphabet x 3^2- Transmitter k has access to the source sequence 
{Uk,i}'iZi, k = 1,2. Consider side information sequences {Vk^i}°Zi, where the source and the side 
information sequences are independent and identically distributed (i.i.d.) and are drawn according to 
joint distribution p{ui, Vi)p{u2, f 2) over a finite alphabet Wi x Vi x W2 x V2; that is, the two source-side 
information pairs are independent of each other. 

For k = 1,2, Transmitter k observes UJ! and wishes to transmit it noiselessly to Receiver k over n 
uses of the channel]. The encoding function at Transmitter k is 

rn J jn -yn 
Ik • ^fc ^ ^k- 

We assume that the side information V^^^^ is available at receiver k, where 7r(-) is a permutation 
of {1,2}. Depending on the scenario, we will specify whether the side information is correlated with 
the desired source or with the interfering source. 

The decoding function at receiver k reconstructs its estimate Uk from its channel output and side 
information vector using the decoding function 

9k ■■ yk X v:(,) 

'Here we use the notation UJ^ — {Uk,i, ■ ■ ■ , Uk,n), and similar notation for otlier length-n sequences. 
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The probability of error for this system is defined as 

p: = vvm,u-)^{ij-,u-)}, 

p«,o^{(f>r,f>2") ^ K,o|(f/r,t/2") = K,o} • 

Definition 1: We say that a source pair (f/i, f/2) can be reliably transmitted over a given interference 
channel if there exist a sequence of encoders and decoders (/", /2 , S'l , ) such that as 

n ^ 00. 

In the following sections, we consider two cases in particular. In the first case, each receiver has 
side information correlated with its desired source, i.e., 7r(A;) = A;, /c = 1, 2. In the second case, each 
receiver has side information correlated with the interfering source, i.e., 7r(l) = 2 and 7r(2) = 1. 
In both cases, we want to exploit the availability of correlated side information at the receivers. In 
the first case, each transmitter needs to transmit less information to its intended receiver due to the 
availability of correlated side information. In the latter case, the side information is used to mitigate 
the effects of interference. 

For notational convenience, we drop the subscripts on probability distributions unless the arguments 
of the distributions are not lower case versions of the corresponding random variables. 

III. Side information correlated with the desired source 

In this section, we consider an interference channel in which each receiver has side information 
correlated with the source it wants to decode, i.e., receiver k has access to side information Vk (see 
Fig. [T]). For this special case, we prove that the source-channel separation theorem applies; that is, 
it is optimal for the transmitters first to apply Slepian-Wolf source coding to compress their sources 
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Fig. 2. Interference channel model in which the receivers have access to side information correlated with the source of the interfering 
transmitter. 

conditioned on the side information at the corresponding receiver, and then to transmit the compressed 
bits over the channel using an optimal interference channel code. Note that, in the general case, we do 
not have a single-letter characterization of the capacity region of the interference channel, yet we can 
still prove the optimality of source-channel code separation. In the proof, we use the n-letter expression 
for the capacity region, which was also used in [26] to prove the optimality of source-channel code 
separation for a multiple access channel with receiver side information and feedback. The main result 
of this section is the following theorem. 

Theorem 1: Sources Ui and U2 can be transmitted reliably to their respective receivers over the 
discrete memoryless interference channel p{yi,y2\xi, X2) with side information Vk at receiver k, k = 
1,2, if 

{H{Ui\Vi),H{U2\V2))etnt{C) (1) 

where int(-) denotes the interior, and C denotes the capacity region of the underlying interference 
channel. 

Conversely, if {H (Ui\Vi) , H (U2\V2)) ^ C, then sources Ui and U2 cannot be transmitted reliably. 
Proof: A proof of Theorem [T] is given in Appendix HI ■ 

IV. Side information correlated vv^ith the interfering source 

In this section we consider the case in which Receiver 1 has access to V2 while Receiver 2 has 
access to Vi, i.e., each receiver has side information about the interfering transmitter's source (see 
Fig. O. We investigate how the side information about the interference helps in decoding the desired 
information. 
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A. Sufficient Conditions for Reliable Transmission 

We first provide sufficient conditions for reliable transmission of the sources. In the spirit of the 
Han-Kobayashi scheme for the classical interference channel, we propose a joint source-channel coding 
scheme that requires the receivers to decode part of the interference with the help of their side 
information. In the Han-Kobayashi scheme, each transmitter splits its message into two pieces to 
allow the non-intended receiver to decode part of the interference. In our scheme, each transmitter 
enables a quantized version of its source to be decoded by both receivers, where the unintended 
receiver uses its correlated side information as well as the channel output to decode the interference 
corresponding to this quantized part. Sufficient conditions for reliable transmission in this setup are 
given in the following theorem. 

Theorem 2: Sources Ui and U2 can be transmitted reliably over the interference channel p{yi, y2\xi,X2) 
with side information Vi at Receiver 2 and V2 at Receiver 1 if there exist random variables Wi and 
W2 such that 



H{U,) 


<I{X^-V2,Y^\W2,Q), 


(2) 


H{U2) 


<I{X2]V^,Y2\W^,Q), 


(3) 


H{U,) 


<I{W2, Xi; V2, FilQ) - /(f/2; W2\Q). 


(4) 


H{U2) 


<I{W^, X2, V^, Y2\Q) - I{U,; Wi\Q), 


(5) 


H{Ur)+H{U2) 


<7(Xi; V2, Yi\Wi, W2, Q) + /(M^i, X2; 14, Y2\Q), 


(6) 


H{Ui)+H{U2) 


<I{X2; V,, Y2\Wi, W2, Q) + I{W2, X,; V2, Y,\Q), 


(7) 


HiU,)+H{U2) 


<IiW,, X2; V,, Y2\W2, Q) + I{W2, X,; V2, Y,\W^, Q), 


(8) 


H{U,)+H{U2) 


<I{W2, X,; V2, Y,\Q) + I{W,, X2; Vu Y2\W2, Q) - I{U,; W,\Q), 


(9) 


H{U^)+H{U2) 


<IiW^, X2; 1/1, Y2\Q) + IiW2, X,; V2, Y,\Wi, Q) - I{U2, W^slQ), 


(10) 


2H{U^) + H{U2) 


<I{W2, Xi; V2, FilQ) + /(Xi; V2, Y^\Wu W2, Q) + liW^ X2; Vi, Y2\W2, Q), (U) 


H{Ur) + 2H{U2) 


<I{W^, X2; V,, Y2\Q) + /(X2; V,, Y2\W,, W2, Q) + I{W2, Xi; V2, Yi|M^i, Q), (12) 



for some p{q), p{wi, xi\ui, q), and p{w2, X2\u2, q), where the entropies and mutual information terms 
are evaluated using the joint distribution 



p{q, Ui, Vi, U2, V2, Wi, W2, Xi, X2, Vl, 1)2) = P{q)p{U]_, Vi)p{u2, V2)p{wi, Xi\ui, q)p{w2, X2\U2, q) 

p{yi,y2\xi,x2). (13) 
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Proof: A proof of Theorem [21 is given in Appendix [III ■ 
We remark here that the achievability scheme in the proof of Theorem [2] uses joint source-channel 
coding and hence, similarly to [16] and [27], the expressions involve joint distribution of the source 
and channel variables, which potentially increases the achievable rate region by enlarging the set 
of possible joint distributions. Below in Corollary [H we provide a sufficient condition for reliable 
transmission based on separate source and channel codes in the spirit of "operational separation" as 
in [17], [25], which can be obtained as a special case of Theorem |2l Note that operational separation 
is different from the classical ("informational") separation, in which each source is first assigned to 
an index and then these indices are transmitted using an optimal channel code for the underlying 
channel. Operational separation corresponds to separation of the source and the channel variables as 
in Corollary [T] without using the optimal source or the channel codes (see [25] for further details and 
examples). 

Corollary 1: Sources Ui and U2 can be transmitted reliably over the interference channel p{yi, y2\xi,X2) 
with side information Vi at Receiver 2 and V2 at Receiver 1 if there exist random variables W^i, Wi 
and W2, W2 such that 

H{U,) <I{X,;Y,\W2,Q), (14) 

H{U,) + I{W2;U2\V2,Q) <I{X,,W2;Y,\Q), (15) 

H{U2) <I{X2;Y2\W,,Q), (16) 

H{U2) + I{W,;U,\VuQ) <I{X2,W,;Y2\Q), (17) 

H{Ui) + H{U2) - /(Wi; V^\Q) <I{X,; Y,\Wu W2, Q) + I{W,, X2; FslQ), (18) 

H{Ui) + H{U2) - /(W2; V2\Q) <I{X2; Y2\Wi, W2, Q) + /(W?2, Xr, Y^\Q), (19) 

H{Ui) + H{U2) - /(W^i; VSig) - /(W2; V2\Q) <I{W^, X2, Y2^2. Q) + Hm, X,; Y,\W,, Q), 

(20) 

H{U,) + H{U2) + /(Wi; U,\Vu Q) - /(W2; l^alQ) </(W^2, X^; Y^\Q) + X2; Y2\W2, Q), (21) 

H{Ui) + H{U2) + /(W2; U2\V2, Q) - /(Wi; Vi\Q) <I{Wi, X2; FalQ) + H^, X^, Yi\Wi, Q), (22) 
2H{Ui) + H{U2) - /(Wi; Vi\Q) - /(W2; V2IQ) </(W^2, Xi; Y^\Q) + /(Xi; Y^\W^, W2, Q) 

+ /(W^i,X2;y2|W^2,Q) and (23) 
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HiUi) + 2H{U2) - liWi- Vi\Q) - I{W2; V^slQ) <I{Wi, X^; Y2\Q) + /(X2; F2|1^2, W^i, Q) 

+ I{W2,X^-Y^\W^,Q), (24) 

for some p{q), p{wi\ui,q), p{jv2\u2,q), p{wi,xi\q) and p(w2,X2|q'), where the entropies and mutual 
information terms are evaluated using joint distribution 

p(g, Ml, vi, U2, V2, Wi,W2, wi, W2, Xi, X2, Vi, 1/2) = p{q)p{ui, vi)p{wi q)p{u2, V2) 

p{w2\u2, q)piwi, Xi\q)p{w2, X2\q)p{yi, y2\xi,X2). 

_ (25) 

Proof: Corollary [H follows directly from Theorem |2l by letting Wk = {Wk,Wk) and fixing the 
distributions as p{wk,Xk\uk,q) = p(wk\uk, q)piwk, Xk\q), for k = 1,2. U 
The sufficient conditions in Corollary [T] are looser than those in Theorem |2l However, it is not clear 
whether they are strictly looser. 

Remark 1: In the special case of no receiver side information, i.e., Vi = V2 = 0, by fixing Wi = 
W2 = 0, and defining Ri = H(Ui) and R2 = H{U2), the sufficiency conditions in Corollary [T] boils 
down to the Han-Kobayashi rate region in the form expressed in [14, Theorem 2]. 

We do not know whether the sufficient conditions for reliable transmission provided in Theorem [2] 
are too strong, leading to pessimistic results in general. However, in Section IIV-C[ we show that for 
some special cases, the sufficient conditions obtained through separate source and channel coding in 
Corollary [U are also necessary, which shows that at least for certain special cases. Theorem [2] is tight. 

B. Deterministic Side Information 

In this subsection, we focus on the special case in which the side information sequences Vi and V2 
are deterministic functions of the sources Ui and U2, respectively, i.e., 

Vk^, = hu{Uk,^). A; = 1,2, ^ = 1,2,--- (26) 

for some deterministic functions hi and /i2, or equivalently we have H{yk\Uk) = for k = 1,2. 

The main result of this subsection is that when the side information is a deterministic function 
of the interfering source, the source-channel separation theorem applies; that is, it is optimal to first 
perform source coding and encode into message Wks, and the remaining part of UJ!, denoted by 
^fc 1^"' i'^to message Wkp, k = 1,2, and then to transmit these messages optimally over the underlying 
interference channel p{yi,y2\xi, X2) with side information Wis at Receiver 2, and side information 
W2S at Receiver 1. 
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Fig. 3. Interference channel with message side information at the receivers. 



First, we define the capacity region of the interference channel with message side information at 
the receivers (see Fig. [3]). In this communication scenario, Transmitter k has two messages Wks and 
Wkp, of rates R^s and R^p respectively, to transmit with negligible probability of error to Receiver k, 
k = 1,2, while Receiver 2 has access to Wis, and Receiver 1 has access to W2s- AH messages are 
independent. A (2"-^^% 2"^ip, 2"-^'^% 2"^^?^ code for this channel consists of two encoding functions, 

:{1, 2, ■ ■ ■ , 2"^!^} X {1, 2, ■ ■ ■ , 2"^!''} ^ X^" (27) 

and 

:{1, 2, ■ ■ ■ , 2"«-} X {1, 2, ■ ■ ■ , 2"^^-} ^ (28) 

and two decoding functions 

g"^ -.y^ X {1, 2, ■ ■ ■ , 2"^^=} ^ {1, 2, ■ ■ ■ , 2"^!^} x {1, 2, ■ ■ ■ , 2"^i''} (29) 

and 

-.y^ X {1, 2, ■ ■ ■ , 2"-f^i=} {1, 2, ■ ■ ■ , 2"^^^} x {1, 2, ■ ■ ■ , 2"^^^}. (30) 
The average probability of error for the (2"'^^o ^ 2"'^^p , 2"'^^'' , 2"'^^p , nj code is defined as 

2nR-^^g 2^^1p 2"-^2s 2^"^2p 
■(ills=l Ulip = l t02s = l "'2p = l 

or g^{Y^,wu) ^ (^2^, W2p)|(wi., ^ip, ^2., ^2^) is sent}. (31) 

Definition 2: A rate quadruplet (-Ri^, -Rip, -^2^5 -R2p) is said to be achievable if there exists a sequence 
of (2"^i% 2"'^if , 2'^^2s 2"'^2p codes for which P^" ^ as n — > 00. The capacity region is defined 
as the closure of the set of achievable rate quadruplets {Ris, Rip, R2S, Rip), and is denoted by C/. 
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In order to show the optimality of source-channel code separation, similarly to Theorem [H we will 
use the n-letter characterization of Cj provided in the next lemma. Define as 

= I {Ris, Rip, R2S, R2p) -Rip < ^ri'S'is) S2s), Ris + Rip < —H-^i'^ '^r\^2s)y 

I n n 

R2p < ^2"l'S'ls5 S2s), R2S + R2p < ^-^("^2 ; ^2\Sls)y 

for any 1 (32) 



Lemma 1: The capacity region of the interference channel with message side information Wu at 
Receiver 2, and message side information W2S at Receiver 1 is 

Ci= lim (33) 



ra— >oo 



where the limit of the region is as defined in [1, Theorem 5]. 

Proof: A proof of Lemma \T\ is given in Appendix |llll ■ 
Now that we have the n-letter characterization of the capacity region of interference channels with 
message side information at the receivers, we are ready to show that the source-channel separation 
theorem holds when the receivers' side information sequences are deterministic functions of the 
interfering sources. 

Theorem 3: Sources Ui and U2 can be transmitted reliably to their respective receivers over the dis- 
crete memoryless interference channel p{yi,y2\xi, X2) with side information Vi = hi{Ui) at Receiver 
2, and side information V2 = /i2(t^2) at Receiver 1, if 

{H{Vi),H{Ui\Vi),H{V2),H{U2\V2)) G mt(C,), (34) 

where Cj denotes the capacity region of the interference channel with message side information at 
receivers. 

Conversely, if {H (Vi), H{U i\Vi), H{V2), H{U2\V2)) ^ Ci, then sources Ui and U2 cannot be trans- 
mitted reliably. 

Proof: A proof of Theorem [3] is given in Appendix |IVl ■ 
The benefits of considering the side information samples as deterministic functions of the source 
samples are two-fold. Firstly, the transmitters also know the side information and they can use this 
knowledge to minimize the amount of interference they cause. Due to this fact, we are able to achieve 
any point in the capacity region of the interference channel with message side information. Secondly, 
encoding the information of V^, A; = 1, 2 into the codebook at Transmitter k not only helps reduce the 

11 



interference at the other receiver, but also does not place any extra burden on Receiver k to decode 
14, as Vfc is a deterministic function of Uk- This fact enables the converse proof of the source-channel 
separation theorem. 

C. Necessary and Sufficient Conditions for Reliable Transmission for a Special Case 

In Section ITV-B I we have shown that source-channel separation is optimal when the side information 
is a deterministic function of the interfering source. Thus, for these cases, if the single-letter characteri- 
zation of the capacity region of the corresponding interference channel with message side information, 
i.e., Cj, is known, we would have necessary and sufficient conditions for reliable transmission in a 
single-letter form. However, a single-letter characterization of C/ is not known in general as it is a 
generalization of the capacity region of the classical interference channel. 

In this subsection, we consider the class of interference channels studied in [13]. We show that 
the Han-Kobayashi scheme is capacity-achieving for this class of interference channels [13] when the 
receivers have message side information, and we obtain a single-letter characterization of the capacity 
region. Hence, we conclude that, for this class of interference channels, when the side information is 
a deterministic function of the interfering source, the sufficient conditions provided in Theorem [21 are 
also necessary, yielding a single-letter characterization of the necessary and sufficient conditions for 
reliable transmission. This means that the achievability result presented in Theorem [2] is tight in some 
special cases. 

The special class of interference channels we focus on in this subsection is a class of Z-interference 
channels. For the Z-interference channels, p{yi, y2\xi, X2) can be written as p{y2\xi, X2) -p^yilxi), i.e., 
the channel between Xi and Yi is a single user channel characterized by p(y i\xi). This corresponds 
to an interference channel in which only the second transmitter-receiver pair faces interference. In 
particular, the members of the class of Z-interference channels we consider satisfy the following 
conditions: 

1) For any positive integer n, H{Y2\X2 = X2), when evaluated with the distribution ^^np(x^) 
p(l/2 Ix", X2), is independent of X2 for any 

2) Define r as 

T= max H{Y2). (35) 

p{xi)p{x2) 

Then there exists a p*{x2) such that H{Y2), when evaluated with the distribution Yl,xi X2^'(^i) 
p*{x2)p{y2\xi.,X2), is equal to r for any p{xi). 
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Please refer to [13] for intuition behind these conditions and examples of Z-interference channels that 
satisfy these two conditions. 

In the next lemma, we provide a single-letter characterization of C/, i.e., the capacity region of 
this class of Z-interference channels with message side information. Since Receiver 1 does not face 
interference, there is no benefit to having access to the side information W2s- Hence, without loss of 
generality, we assume i?2s = 0. 

Lemma 2: The capacity region of Z-interference channels satisfying Conditions 1 and 2, with 
message side information Wis at Receiver 2, is characterized by 

Rip + Ris<I{Xi;Yi), (36) 
R2p<I{W,X2]Y2) Mid (37) 
Rip + R2p < /(Xi; Yi\W) + I{W, X2; ¥2) (38) 

for some p{w)p{xi\w), where the mutual informations and entropies are evaluated with the joint 
distribution of the form 

p(w, xi, X2, yi, 2/2) = p{w)p{xi\w)p*{x2)p{yi\xi)p{y2\xi, X2). 
Proof: A proof of Lemma [2] is given in Appendix |Vl ■ 
The proof of Lemma [2] indicates that superposition encoding and partial decoding is capacity- 
achieving. More specifically, the codebook at Transmitter 1 is such that the inner codebook carries 
the side information at Receiver 2, i.e., Wu, and part of Wip, and the outer codebook carries the 
remaining part of Wip. 

Comparing these results in the case of side information at the receiver with the traditional Z- 
interference channel [13], the rate of Wip takes the place of Wi, which means that the message that 
causes interference is reduced from Wi to Wip. Due to the fact that Wis is available at Receiver 2, 
Wis does not cause any interference and therefore its rate can be made as large as possible within the 
constraint of the capacity of the channel p{yi\xi) depicted by (l36l) . 

Having established the capacity region of this special class of Z-interference channels with message 
side information at the receiver, we next consider the joint source-channel coding problem for this 
channel model with the assumption that each side information sample Vi^i is a deterministic function 
of the corresponding source sample f/i ,, i.e., Vi^i = for i = 1,2, ■ ■ ■ for some deterministic 

function hi. Since the first transmitter-receiver pair is interference-free, without loss of generality, we 
assume V2 = 0. 
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Since source-channel separation is shown to be optimal in Theorem [3] for the source and side 
information structure under consideration, we are able to characterize necessary and sufficient condi- 
tions for the reliable transmission of the sources in the single-letter form using the capacity region 
characterization given in Lemma [U 

Corollary 2: For Z-interference channels satisfying Conditions 1 and 2, and side information Vi = 
hi{Ui) at Receiver 2, necessary and sufficient conditions for reliable transmission are 

H{U^) <I{X,;Y,) (39) 
HiU2) <IiW,X2;Y2) and (40) 
H{U,\V,) + H{U2) <I{W, X2; Y2) + /(Xi; Y^\W) (41) 

for some p{w)p{xi\w), where the mutual informations and entropies are evaluated with p{ui,vi,U2, 
w, xi, X2, yi,y2) = p{ui,vi)p{u2)p{w)p{xi\w)p*{x2)p{yi\xi)p{y2\xi,X2). 

Proof: Corollary [2] follows directly from combining Theorem [3] and Lemma [2l ■ 

In Corollary [B specify V2 = 0, choose W2 = W2 = 0, Wi = Vi, Q = (/} and p{x2) = p*{x2). 
Renaming Wi as Wi and using Condition 2 and the fact that H{Ui) — H{Vi) = H{Ui\Vi), we. obtain a 
sufficient condition which is the same as the necessary and sufficient condition specified in Corollary 
|2l Hence, we conclude that in this special case, the sufficient conditions described in Corollary [T] based 
on separate source and channel coding are also necessary. This shows that the conditions presented 
in Theorem [2] are also necessary at least in certain scenarios. 

Corollary [2] shows how the side information Vi = hi{Ui) about the interference Ui helps in reliable 
transmission, and determines the most efficient way of using this side information: Transmitter 1 
performs a separation-based encoding scheme. It first splits its source t/" into and a remaining 
part using entropy-achieving data compression techniques, and thus obtains two messages Wy^ and 
Wu^ivi'- Then, it further splits message Wu^ivi" into two parts Whinner and PKuter, at rates 7 and 
H{Ui\Vi) — 7, respectively. Next, it performs superposition encoding, transmitting Wy^ and Whinner 
through the inner code at rate -f^(Vi) + 7, and VFouter through the outer code at rate H{Ui\Vi) — 7. 
Transmitter 2 performs separation-based source-channel coding, first mapping U2 into a message 
W2 and then mapping W2 into a codeword of an i.i.d. codebook generated with distribution p*{x2). 
Receiver 1 decodes both the inner and the outer codes. Receiver 2 knows the side information and 
hence sees an inner codebook at an effective rate of 7 only. It decodes the inner codeword and the 
codeword of Transmitter 2 jointly using the received signal and the available side information about 
the interference. 
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The intuition obtained from the special case derived in this subsection is that one should put as much 
information as possible about the side information within the inner codebook, in order to minimize 
the impact of interference when the side information about the interference is available at the receiver. 

V. Z-Channel with Degraded Message Sets 

The result in (I36l)-(l38l) is directly related to the capacity region of the Z-channel with degraded 
message sets, based on the intuition gained from the proof of Theorem 3 in [19]. The intuition in 
[19] is that when the receiver has some side information about the undesired message, we can set up 
a new scenario in which the receiver does not have access to the side information, and is required to 
decode it. Then, when we remove the rate constraint associated with decoding of the side information 
at the receiver in the capacity region of the new scenario, we get the capacity results of the original 
scenario. Therefore, the solution given in (l36l)-(l38]) resembles the solution of the following problem. 

The channel is described by two transition probabilities p{yi\xi) and p{y2\xi, X2), and satisfies 
both Conditions 1 and 2. There are three independent messages Wic, Wip and Transmitter 1 has 
messages Wic and Wip and Transmitter 2 has message W2. Wic needs to be decoded at both receivers, 
while Wip and W2 need to be decoded only at Receiver 1 and Receiver 2, respectively. 

This channel model includes the Z-interference channel as a special case, when the rate of Wic is 
zero. Compared to the definition of the Z-channel in [28], Wic is not only intended for Receiver 2, but 
also for Receiver 1. Therefore, we call this channel model as the Z-channel with degraded message 
sets. 

Then the capacity region for the Z-channel satisfying Conditions 1 and 2, with degraded message 
sets can be characterized as follows: 

Rip<I{Xi;Yi\W)+j, (42) 
Ric + Rip<I{Xi;Yi), (43) 
Ric<I{W;Y2\X2)--f and (44) 
R2 + Ric<r- H{Y2\W, X2) - 7, (45) 

for some p{w)p{xi\w) and 7 > where the mutual informations and entropies are evaluated using 
p{w,xi,X2,yi,y2) = p{'w)p{xi\w)p*{x2)p{yi\xi)p{y2\xi,X2). The proof of this result follows from 
arguments very similar to those used in the scenario of message side information at the receiver 
considered in Lemma [2l 
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VI. Conclusions 



We have studied the problem of joint source-channel coding in interference channels with correlated 
receiver side information. In the case when the receiver side information is correlated with its desired 
source, we have shown that separate design of source and channel codes is optimal. In order to 
minimize the interference to the other transmitter-receiver pair, the transmitters should transmit only 
the part of their sources that is not already known by their corresponding receivers. 

For the case in which the receiver side information is correlated with the interfering source, we 
have provided sufficient conditions for reliable transmission by proposing a joint source-channel coding 
scheme based on the idea of superposition encoding and partial decoding of Han and Kobayashi. As 
a special case, we have focused on the scenario in which the side information at the receiver is a 
deterministic function of the interfering source, and we have shown that source-channel separation is 
optimal for this situation as well. In both cases for which the optimality of source-channel separation 
is established, we have used the n-letter expression for the capacity region as a single-letter expression 
is not available in general. 

Finally, for a class of Z-interference channels for which superposition encoding and partial decoding 
is optimal in the absence of receiver side information, when the receiver facing interference has 
access to a deterministic function of the interfering source, we have shown that the provided sufficient 
conditions are also necessary. Hence, the sufficient conditions are tight at least in some special cases. 



The achievability part of the proof is straightforward. If ([T]) holds, then there exists a rate pair 
R2) in the interior of C such that H(Uk\Vk) < Rk for k = 1,2. Each transmitter compresses its 
source with respect to the side information at its own receiver. This can be done at rate Rk due to the 
Slepian-Wolf theorem. Then the compressed bits can be transmitted reliably over the channel since 
(i?i,i?2) is in the capacity region of the underlying interference channel. 

To prove the converse, we first provide an infinite letter expression for the capacity region of the 
interference channel given in [2]. We define 



Appendix I 



Proof of Theorem [H 



n 




(46) 



Then 



C 



lim E, 



(47) 
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where the limit is defined as in [1, Theorem 5]. C is a closed convex set in the Euclidean plane. 
From Fano's inequality [29], we have, for A; = 1, 2, 

where 5{x) is a non-negative function approaching zero as x 0. 
Next, we write the following chain of inequalities: 

-/(Xf;yn > -liW-Y^) (49) 

n n 

= -J(f/r,V;";Fi") (50) 
n 

> -/(f/";r"|K") (51) 
n 

= ^ [/7(t/f|V;") -/7(f/nv;",Fi")] (52) 

> H{Ui\Vi)-H{U,^\U^) (53) 

> H{Ui\Vi)-6{P:) (54) 



where (|491) follows since f/" X" form a Markov chain, similarly (1501) follows since Vi — > 

F" form a Markov chain, and finally (|54|) follows from Fano's inequality. Similarly, we can 
also show 

h{Xl^-Y^) > H{U2\V2)-5{P:), (55) 

where the joint probability distribution factors as p(x")p(x2). 

From the capacity region given in (gV]), we see that {H{Ui\Vi) - 5{P^), H{U2\V2) - S{P^)) E C 
for all n. Then, since 5{P^) ^ as n ^ oo, and from the compactness of the capacity region, we 
can conclude that P" — > implies that {H {Ui\Vi) , H {U2\V2)) G C. This completes the proof. 

Appendix II 
Proof of Theorem [2] 

We first briefly review the notions of types and strong typicality that will be used in the proof. 
Given a distribution p{x), the type P^n of an n-tuple x" is the empirical distribution 

= iAr(a|x") 
n 

where A^(a|x") is the number of occurrences of the letter a in x". The set of all n-tuples x" with 
type Q is called the type class Q and is denoted by T"-{Q). The set of (5-strongly typical n-tuples 
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according to p{x) is denoted by T"(X) and is defined by 



-N{a\x'') -p{a) 



n 



< 5,\/a E X and N{a\x^) = whenever p(x) = 

The definitions of type and strong typicality can be extended to joint and conditional distributions 
in a similar manner [30]. We have 

^ < S (56) 



-log|T:(X)|-/7(X) 



n 

for sufficiently large n. Given a joint distribution p{x,y), if (x", y") ~ p"{x)p^{y), where and 
are n-fold products of the marginals p{x) and p{y), then 

Pr{(x",?/") G r;'(XF)} < 2-"(^(^;^)-35). (57) 

Now, we start the achievability proof. Fix a joint distribution as in (fT3l) . where Vi), p{u2, V2), 
p{yi,y2\xi,X2) are given while we are free to choose p{q), p{wi, Xi\ui, q) and 
piw2,X2\u2,q). 

Codebook generation: First, generate one random n-sequence in an i.i.d. fashion according to 

p{q)- 

Next, for Transmitter 1, generate a codebook of size Li with ^ logLi > I{Ui; Wi\Q), in which the 
codewords are generated i.i.d. with distribution p{wi\q). This codebook is denoted by C^. 

For each possible source output m", count the number of codewords in that are jointly typical 
with u". If there are at least Li2^"^^^'^'^'^^'^^~'^^'' codewords in jointly typical with u", choose one 
uniformly at random, and call it If there are fewer than L-^^2^i^HUi;Wi\Q)~2n€ codewords of 

jointly typical with u", randomly choose one codeword from to be w'i{ui ). The reason why we 
require the number of codewords jointly typical with m" to be large is to benefit the probability of 
error calculation later on in the proof. In a similar fashion, we generate C^. 

Define F(u1, U2) as the event that the number of G jointly typical with is larger than 
l^^2-'^i(Ui;Wi\Q)-2ne ^j^g numbcr of w'^ G Cl jointly typical with is larger than i,22-'^^(^2;W^2|Q)-2n6_ 

Next, we will show that 

Fr{F%U^,U^)}<3e, (58) 

where "c" denotes the complement. 

For each (g",M",M2) G T"{QUiU2), define the random variable v{i,u1) as follows: is 1 if 

the i-th codeword of is jointly typical with and otherwise. Then, 

2-ni{UrWi\Q)-ne < E[z/(i,0|g"] = Pr{z/(i, <) = l|g"} < 2-"^(^i'^ilQ)+"^ (59) 

V[z/(^,<)|g"] < E2[K^,<)|g1 < E[K^<)] (60) 
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where E and V denote the expectation and variance, respectively. Further define random variable 
N(ui) as the number of codewords in that are jointly typical with m", i.e., 

Li 

iV«) = ^z/(z,<). (61) 



i=l 



Then, from (|59l) and (l60l) . we have 

Li 

j^^^-niiu,;Wi\Q)-ne < E[iV«) = ^ E[z/(i, <) |g"] < Li2-"^(^^'^il<3)+"^ (62) 



V[iVK)|gT = 5^ V[z/(2,<)|g"] < E[iV«)|g"]. (63) 

Hence, we have 

Pr{iV«) <L^2-"^(^i'^il^)-2"^|g"} 

= Pr {E[Ar«)|g"] - iV«) > E[iV«)|g"] - Li2-"^(^1'^iI^)-2"%"} (64) 

< Pr {E[iV«)|g"] - A^«) > Li2-"^(^1'^^I^)-"^ - Li2-"^(^^'^il<3)-2ne|g"} (65) 

< Pr {|E[A^«)|g"] - Ar«)| > Li2-"^(^1'^^I^)-"^ - Li2-"^(^i'^^l^)-2"^|g"} (66) 

< V^^KM (67) 

~ (Li2-"'f(^i;^il<5)-'^<: - Li2-"'^('^i;^il'3)-2ne) 

< Eimm , (68) 

~ (Li2-"'f(^i;^il<5)-'^^ - Li2-"'^(^i;^il<3)-2"^) 

< o (69) 

~ (Li2-"-f(^i;^il<5)-'^^ - Li2-"-f(^i;^ilQ)-2"«^) 

< e (70) 



where (|65l) and (1691 ) follows from (1621) . (1671) follows from Chebyshev's inequality, (|68l) follows from 
(|63l) . and (iTOl) is true when n is large enough. The same analysis applies for 
Hence, we have proved that 

Pr{F"«,0|g"} = Pr{Ar«) < Li2-''^^^''^'\^^-^''' or Niu"^) < L22-'^^^^^■^wm-2ne^qn^^ 

< 2e (71) 
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for all (g",M",M2) ^ T^{QUiU2) and all sufficiently large n. This means that 



Fr{F%U^,U^)}= J2 Pr{F%U^,UmU^,U^,Qn = i 



:<,<,g")} 



•Pr{(t/r,^2",Q") = K,<,g")} 



(72) 



+ 



■Pr{(f/r,t/2",Q") = K,«2,g")} 
5^ Pr{F^(f/r,f/2")|(t/r,[/^,Q") = K,«^,g")} 



ig",u1,ul^)^Tp{QUiU2) 



■Pr{(f/r,f/2",g") = «,«^,g")} 



(73) 



(74) 



<3e 



(75) 



where (1741) follows from (fTTI) . and (fTSl) follows when n is large enough from the asymptotic equipar- 
tition property (AEP) [29]. 

This means that with large probability, the number of sequences jointly typical with f/" and U2 in 
codebooks and Cl are larger than Li2-"^(^i'^ilQ)-2"^ and L22-"^(^2;iy2|Q)-2nE^ respectively. This 
fact will be used in the probability of error calculation. 

Codebook generation: For each possible sequence, generate one sequence in an i.i.d. fashion, 
conditioned on w^{u'^), and g", according to p{xi\ui,wi, q). This sequence is denoted by 
The collection of all sequences will be denoted as the codebook C^. Similarly, we 
generate the codebook C^. 

Encoding: When Transmitter 1 observes the sequence u", it transmits x^{u^,Wi{u'^)). Similarly for 
Transmitter 2. 

Decoding: Receiver 1 finds the unique pair (^",^2)' ^1 ^ ^2 ^ ^'^'^^ that 
Xi{ui,Wi{ui)), W2, yi-,V2) are jointly typical and declares the first component of the pair as the 
transmitted source. If there are more than one pair, and the first component of the pairs are the same, 
then the decoder declares the transmitted source to be the first component. If there are more than one 
pair, and the first component of the pairs are not the same, an error is declared. Also, if no such pair 
exists, an error is declared. Similarly for Receiver 2. 



Probability of error calculation: Denote by E{ui,W2) the event (m", X"('u", w"('u")), 
w^,Y^^,V^) e T^{UiWiXiW2YiV2\q'') for K, ii;^) G x C^. Further denote by G'«,u5) the 
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event {u'l,u^,w'^{u'l),w^{u^)) e T^{UiU2WiW2\q'^). 

Then, the probability of error at Receiver 1 conditioned on Q"' = q"-, denoted by P^, is given by 



<vA E^iU^, w^iU^)) or F%U^, U^) or G^(f/r, U^) or (J E«, wl^) 
<Pt{E%U^,wUU^)) or F%U^,U^) or G''iU^,U^)} 



EnFnG 



<Pr{F"(f/f, + Pr {^^(^yf , U^)\F} + Pr {E'{U'^,w^{U'^))\F n G} 



(76) 



(77) 



(78) 



+ J2 Pr{^«,0|^nFnG} ^ , (79) 

where we have used the short-hand E, F and G to denote events E(U^, 102(1/2)), F(f/", f/^) and 
G{U^,U^), respectively. 

The first term in (|79l) is bounded by 3e as shown by (fTSl) . From the achievability results of multi- 
terminal rate-distortion theory [31], the second term in (1791 ) is bounded by e for sufficiently large n. 
The third term in (|79l ) is bounded by e for sufficiently large n based on the AEP [29]. Hence, from 
now on, we will concentrate on the fourth term in (1791) . 

The fourth term in ([791) may be upper bounded by the sum of the following four terms, which will 
be denoted by Ai, A2, A3, and ^4, respectively: 



^ PT{E{ulw^{umEnFnG} > 



(80) 



A, ^E 



J2 Pr{^K,w^)|^nFnG'} 



(81) 
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J2 Pr{E{u'i,w^{UmEnFnG} > 



and 



First, we upper bound Ai. Define the set 

B, = K e : < ^ f/r,<K) ^ <(f/r), «,<«)) e T:{U,Wr\Y,^V,-w^{U^)qn}. 
Then, we have 

Hence, we may write 



< E||i3i| maxPr{E«,w^(t/2"))|FnFnG'} 

= E||i3i| max Pr{Xf«,<«)) G T;'(Xi|««)w;^(f/2")Fi"V2V)l^ H F n G} 

< eI\B^ I max 2"^(-^il'^i'^i''^2,yi,V'2,Q)+ne2-ni/(Xi|(7i,w/i,Q)+ne|^ R F H gI 
~ \ uieBi I J 

^ 2"'f^(^i)2""^('^i'^i'^i'"^i'^2l^2,Q)+5ne 
<^ 2"'f^('^i)2-"^(^i;^i''^2|^2'<3)+5"<: 



where (|9T| ) follows because the distribution in (fT3] ) satisfies the Markov chain relationship (f/i, W^i) 
(Xi, W2,Q) — ^ (V2, Fi). Next, we upper bound A2. Define the set 



2 



Then, we have 



«,<«), ^2") e T:iU^W^W2\Y{'V,-q-)}. 

E{|i32|} <2"^(^2|yiy2,Q)+ne2~"'^(^2|Q)+riej^^^ _ j^-j 

2nH(Ui\W2,Yiy2,Q)+ne2nH{Wi\Ui,W2,Yi,V2,Q)+ne2-nH{Wi\Ui,Q)+ne 



22 



Similarly to (1861)-(1901), we may write 

"^2 = E< Yl Pr{E{u'l,wJ^)\EnFnG}\ (94) 

where (|96l ) follows from the same reason as ( |9T| ). Next, we upper bound A3. Define the set 

B, = {< : < ^ f/r, <«) = wuu^), < G T:(^iK(^r)ri"\/>2"(t^2 )?")}• (97) 

Then, we have 

Ell^all^nFnG} <2"^(^il^^'^i'^^'^^'«)+"^ ,,,, J,^, , (98) 

which follows from the fact that we always choose randomly from at least 2vi2"^*^^i'^il*^)~2"'' choices 
to get tt;"(M"). Similarly to (I86l)-(|90|). we may write 

As = e\ Pr{E{ulw^iU^))\EnFnG}\ (99) 

nnH{Ui) 

< f 2-nI{Ui,Xi;Yiy2\Wi,W2,Q)+5n€ ^^qq^ 

OnHiUi) 

< f 2-nI{Xi;Yl,V2\Wi,W2,Q)+5nt ^^q^^ 

where (IIOII) follows from the same reason as (|9T1) . Finally, we upper bound A4. Define the set 

K, w^) G T;^(f/iVr2K(f/r)^i"^2V)}- (102) 

Then, we have 

E {|i34||i? n F n G} <2"'f^(^2l^i'^2'^i''3)+"'^2""^(^2l*^)+"'(L2 - 1) 

2nH{Ui\Wi,W2,Yi,V2,Q)+ne ^ (]()^\ 

2-nIiUi;Wi\Q)-2nel^- ^^""-^^ 



Similarly to (I86l)-(l90l). we may write 



A4 = eI Y [E{u'l,w^)\EnFnG]\ (104) 
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where (|106l) follows from the same reason as (|9T1) . 

We have similar probability of error calculations at Receiver 2. Since 

P: < EQn[P^ + P^], (107) 

for this achievability scheme, as long as the following equations are satisfied, 

H{U^)<I{X^■,V2,Y^\W2,Q), (108) 

H{Ui) - log Li < /(Xi; V2, Filial, W2, Q), (109) 

H{U^) + log L2 < I{W2,X,-V2,Y^\Q), 

H{Ui) + \ogL2 - logL^ < I{W2,X,;V2,Y,\WuQ), (HO) 

H{U2)<I{X2;VuY2\Wi,Q), 

H{U2) - logL2 < I{X2;VuY2\W,,W2,Q), (111) 

H{U2) + log Li < I{Wi, X2; Vi, Y2\Q), 

H{U2) + logLi - log L2 < I{Wi, X2; Vi, Y2\W2, Q), (112) 

logLi > I{Ui; Wi\Q) and (113) 

logL2>I{U2;W2\Q), (114) 

for some p{q), p{wi, xi\ui, q), and p{w2^ X2\u2, q), the probability of error is arbitrarily small for 
sufficiently large n. 

By Fourier-Motzkin elimination, we obtain the sufficient conditions given in Theorem [2l 

Appendix III 
Proof of Lemma [H 

We first start with the proof of achievability. Fix distributions p{sis), p{xi\sis), p{s2s) and p{x2\s2s)- 
For codebook at Transmitter A;, A; = 1, 2, we generate an inner codebook of 2^^*== i.i.d. codewords of 
length with probability Y[iLiPi^ks,i)- Then, for each codeword of the inner codebook, we generate 
an outer codebook of 2^^*=^ i.i.d. codewords of length with probability Y[iLiPi^k,i\sks,i)- For 
Wks = Wks and W^p = Wkp, Transmitter k sends the w^p-th codeword of the w^s-th outer codebook. 
For decoding. Receiver 1 finds the codeword in all possible outer codebooks that is jointly typical 
with the received sequence and the W2s-th codeword of the inner codebook of Transmitter 2. Similarly 
for Receiver 2. The probability of error analysis follows from standard arguments [29], and we can 



24 



show that the probability of error can be driven to zero as A?^ — > oo, as long as the rates satisfy the 
following conditions: 

Rip<I{X,;Y,\Su,S2s), (115) 
Ru + Rip<I{X,;Y,\S2s), (116) 
R2p<IiX2;Y2\Su,S2s) and (117) 

R2s + R2p<I{X2;Y2\Sls)- (118) 

For each n, similarly to [1, Theorem 5], by treating the interference channel 1/2 , a^g), 

which is a product channel of p{yi,y2\xi, X2), as a memory less channel, we conclude that the rates 
satisfying the following conditions are achievable for any n: 

Rip<-I{X^;Y,-\Sl,Sl), (119) 

Th 

Ris + Rip < -l{x^; Yr\Sl), (120) 

Th 

R2p < -i{x^; ITI-SL, SI) and (121) 

Th 

R2s + R2p<lliX'2;Y^'\Sis), (122) 

i.e., any rate quadruplet {Ru, Rip, R2S: R2p) e is achievable. By the definition of the capacity 
region, the limiting points of are also achievable, and thus, we have proved the achievability of all 
the points in Ci. 

We next prove the converse. For any ^2"^i'', 2"^ip, 2"^^% 2"^^?', n) code, denote its input to the 
channel as random variables X" and X^ and the output of the channel as random variables Y-[\ Y2. 

Arbitrarily choose Mu = 2"^^" n-letter sequences uY,ul^, ■ ■ ■ ,u]^^^ all in A"", and M2S = 2"^2s 
n-letter sequences uf^, • • • , uj^^^ all in X^. Form a one-to-one correspondence between Wu, W2S 
and S'^g, S2S, respectively by 

I 1 if = u}' , wi, = 1, 2, ■ ■ ■ , Mis 
p-{SZ^u^\Wis^wu)^{ (123) 

otherwise 

,1 if li" = uli , = 1, 2, • • • , M2s 

p^{Sl = u''\W2s = W2s)= { (124) 

otherwise 
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By Fano's inequality [29], we have 

nR,p = H{W,p) = H{W,j,\Wu, W^s) (125) 

= I{W,p; Y,^\Wu, W2s) + H{W,p\Y,^, Wu, W,s) (126) 

< I{Wip, Y,"\Wis, W2s) + H{Wip\Y,^, W2s) (127) 

< I{W,p, Y.^lWu, W2s) + n5{P:) (128) 

< /(Xf; Y^Wu, W2s) + n5{P:) (129) 
= /(Xf; Y^\S^^, SI) + n6{P:) (130) 

where 5{x) in (11281 ) is a non-negative function approaching zero as x — > 0, (11291 ) follows from data pro- 
cessing inequality [29] because the distributions factor as p{wip)p{wis)p{x'i\wip,Wis) 
Piw2p)p{w2s)pix2\w2p, W2s)p{yi\x'^, Xg) and satisfy the Markov chain relationship (Wip, Wis) i^i^ W2s 
y™, and (|130l) follows from the definitions of the sequences S^^ and in (11231) and (|124)) . respectively. 
We also have 

nRu + riRip = H{Wu, Wip) = H{Wu, Wip\W2s) (131) 
= I{Wu, Wip- Y^\W2s) + H{Wu, Wip\Y^, W2s) (132) 

< I{Wu, Wip- Y^\W2s) + n5{P:) (133) 

< /(Xf; Y^\W2s) + n5{P:) (134) 
= I{Xl,Y^\Sl)+n5{P:) (135) 

where (11341) follows from the same reason as (11291) . and (|135l) follows from the same reason as (|130l) . 
Similarly, we have 

nR2, < /(X- y^l^i" , ^2" ) + n6{P:) (136) 
ni?2s + nR2p < /(X2"; Fal'^rj + n5(P;). (137) 
Hence, we have proved that for all n, 

{Ru - 5{P:), Rip - 5{P:), R2s - 5{P:), R2p - 5{P:)) e g^. (138) 
Since the region C/ as defined in (1331) contains for every n [1, Theorem 5], we have 

{Ru - S{P:), Rip - 6{P:), R2s - 5{P:), R2p - 5{P:)) G Ci (139) 
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for all n. For codes where Pg" as n — > oo, we have 

{Ris, Rip, R2s, R2p) & Cj (140) 
since Cj is closed [1, Theorem 5]. This concludes the converse part of the proof. 

Appendix IV 
Proof of Theorem [3] 

The achievability part of the proof is straightforward. If (l34l) holds, then there exists a rate quadruplet 
{Ris, Rip, R2s, R2p) in the interior of C such that H{Vk) < Rks and H{Uk\Vk) < Rkp for k = 1,2. 
Transmitter k first compresses Vk into index Wks with rate H{Vk), and then Uk\Vk = Vk into index 
Wfcp(ffc) into rate H{Uk\Vk), for all Vk in the typical set. Then the indices can be transmitted reliably 
over the channel since (Ris, Rip, R2s, R2p) is in the capacity region of the underlying interference 
channel with message side information Wis at Receiver 2 and W2s at Receiver 1. 

To prove the converse, we write 

nH{Ui\Vi) = HiU^lV^") = if(f/f |V;", V^) (141) 

= I{U^; r"|V;", V^) + HiU^'lY^, V;", V7) (142) 

< J(f/f; Fi"|V;", V^^) + H{U'^\Yl', V;') (143) 

< /(f/f; y"|V^", V7) + n5{P^) (144) 

< /(Xf; Y^\V^, V^) + n5{P:^) (145) 

where (|144)) follows from Fano's inequality and 5{x) is a non-negative function approaching zero as 
x ^ 0, and (11451) follows from the data processing inequality, in other words, from the Markov chain 
relationship (f/f , V^) (Xf , V^) ^ We can also write 

nH{Vi) + nH{Ui\Vi) = nH{Ui, Vi) (146) 

= nHiUi) (147) 

= H{U'^) (148) 

= m") (149) 

= I{U{'; + H{U'^\Yl\ V^) (150) 

< /(f/f; Y^\V^) + n5(P;) (151) 

< /(Xf ; ri"|y2") + n5(P;) (152) 
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where (11471) follows because Vi is a deterministic function of Ui, and (11511) follows from Fano's 
inequality, and (11521) follows from the same reasoning as applied to (11451) . Similarly, we have 

nH{U2\V2) < I{X^; l^l^i", ^^2") + ^-^(^D and (153) 

nH(V2) + nH{U2\V2) < I{X^; l^l V^i") + n6{P^). (154) 
Hence, from (I145D . (I152D . (11531) and (11541) . we have 

(/f(\/i) - 5(P;), - (5(P;), //(Vs) - S{P:), H{U2\V2) - 5{P:)) e (155) 

which by the same reasoning as applied to (11391) and (11401) . for codes where Pg" as n — > 00, we 
have 

{H{Vr),H{U^\V^),H{V2).H{U2\V2)) eCi (156) 

which concludes the proof. 

Appendix V 
Proof of Lemma [2] 

Due to the fact that the proof of this lemma is very similar to the proof of the capacity region in 
[13], we omit certain details. For notational convenience, denote the channel of p{yi\xi) as Vi and 
the channel p{y2\xi.,X2) as V2, where 

Vi{a\b) = Pr{Fi = a\Xi = b}, (157) 

and 

V2{c\b, d) = Pr{Y2 = c\Xi = 6, X2 = d}. (158) 

A. Converse Result 

The converse result derived in this subsection is valid for any Z-interference channel satisfying 
Condition 1. The tool that we are using comes from the following lemma. 
Lemma 3: [30, page 314, eqn (3.34)] 

For any n, and any random variables F" and Z"' and W, we have 

n 

i7(Z"|W^)-/7(r"|iy) = Y,iH{Zi\Y'-\ z,+2, ■■■ .z^,w) 

i=l 

- H{Y,\Y'-\ Z,+2, ■ ■ ■ , ^n, W)). (159) 



28 



Since the rate triplets (-Ri^, -Rip, R2p) is achievable, there exist two sequences of codebooks 1 and 2, 
denoted by and , of rate Ria + -Rip and -R2p, and probability of error less than e„, where Pg" — ^ 
as n ^ oo. Let us define X" and be uniformly distributed on codebooks 1 and 2, respectively. 
Let be connected via to X", be connected via to X" and Xj . 

We start the converse with Fano's inequality [29], 

ni?ip = H{Wip) (160) 

<I{Wip-Y^)+n6{P:) (161) 

</(l^ip;yi"|iyi,)+n5(P;) (162) 

= H{Y^\W^,) - H{Y^\Wis, lVip,Xr) + n6{P2) (163) 

= i/(Fi"|iyi,) - H{Y^\X'l) + n5(P;) (164) 

n 

= H{Y^\Wu) -Y,H{Yv;\Xu)+n5{P:) (165) 

i=l 

where (|162l) follows from the fact that VTi^ and Wip are independent, (11631) follows from the fact that 
without loss of generality, we may consider deterministic encoders, (11641 ) follows from the Markov 
chain relationship {Wu-, Wip) X" — > F", and (11651) follows from the memoryless nature of V^". 
We also have 

nPi, + nPip = HiW^p, Wis) (166) 
< J(l^ip, Wu. n") + n5{P:) (167) 
< /(Xf; Fi") + n5(P;) (168) 

n 

<Y,I{Xu\Yu)+n5{P:) (169) 

i=l 

where (11681) follows from the data processing inequality [29]. Furthermore, we have 

nPsp = H{W2p) = H{W2p\Wu) (170) 

<I{W2p-^Y:,-\Wu)+n5{P:) (171) 

<I{Xl,Y^\Wis)+n5{P:) {111) 

= H{Y:^\Wu) - H{Y,^\Xl Wu) + n6{P:) (173) 

n 

< H{Y2i) - HiY^'lXl Wu) + n5{P:) (174) 

i=l 

<nT- H{Y^^\Xl Wu) + n5{P:) (175) 
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where (11701) follows from the independence of W2p and Wis, (|172|) follows from the Markov chain 
relationship (X^, Wu) — * ^2"' (|174l) follows from the fact that conditioning reduces entropy, 

and (11751 ) follows from the definition of r in (|35l) . 
Let us define another channel, V2 : Xi as 

V^2(tkl) = V2(tkl,X2), (176) 

where X2 is an arbitrary element in X2. Further, let us define another sequence of random variables, 
T", which is connected via Vg", the memoryless channel V2 used n times, to X", i.e., Tj Xu 
T{j}c, Xi{j}c, X2 , F", ¥2- Also define as the n-sequence with X2 repeated n times. It is easy to see 
that 



H{Y2-\X2 , Wis) = E E ^^^(^2-1^2 = Wis = w) (177) 

= E ^^(^2"l^2 = ^2, W^ls = W) (178) 
«)=1 
2"^is 

= Y.^Hin^is = w) (179) 



?i)=i 

= if(T"|W^i,) (180) 

where (11781) follows from the fact that the channel under consideration satisfies condition [H and (11791) 
follows from the definition of T". 
By applying Lemma [3l we have 

n 

H{T"]Wis)-H{Y,^\Wis) = Y,Hm\YI-\T,+i,T,+2,--- ,T^,Wis) 

i=l 

- H{Yu\Yr\T,+i,T,+2, - ■ ■ ,T,„Wis). (181) 
Furthermore, since conditioning reduces entropy, we can write 

n n 

H{Y(^\Wis) = J2H{Yu\Yr\Wis) > Y,H{Yu\Yr\T,+i,T,+2,--- ,T^,Wis). (182) 

i=l i=l 

Define the following auxiliary random variables, 

W, = Yl-\ T,+i, T,+2, ■ ■ ■ , T„, W^i,, ^ = 1, 2, ■ ■ ■ , n. (183) 

Further define Q as a random variable that is uniform on the set {1,2, ■ ■ ■ ,n} and independent of 
everything else. Also, define the following auxiliary random variables: 

W = {Wq,Q), Xi=XiQ, Yi = YiQmd T = Tq. (184) 
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Then, from (fTSTl) and (fT82l) . we have 

^-1 _ H{Y^\Wu)) = H{T\W) - H{Yi\W) and (185) 

n~^H{Y;'\Ws) > H{Yi\W). (186) 

Due to the memory less nature of and Kg"' the fact that Q is independent of everything else, and 
the Markov chain relationship Tj Xu — > Yu, for i = 1, 2, ■ ■ ■ , n, the joint distribution of W, Xi, 
Yi, T satisfies 

p{w,xi,yi,t) = p{w)p{xi\w)Vi{yi\xi)V2(t\xi,X2). (187) 
From (|185l) and (11861) . we may conclude that there exists a number 7 > such that 

-HiT'^lWu) = H{T\W) + 7, -HiY^'lWu) = H{Yi\W) + 7. (188) 

n n 

By combining (|165l) . (11691 ), (11751) . (11801) . (|187l) . and (11881) . and allowing n ^ 00, we obtain the 
following converse result: for any Z-interference channel that satisfies Condition 1 and the case where 
Receiver 2 has side information Wis, the achievable rate triplets {Ru, Rip, R2p) must satisfy 

Rip < H{Yi\W) + 7 - H{Yi\Xi), (189) 

Ris + Rip < I{Xi; Yi) and (190) 

R2p<r-H{T\W)--i, (191) 

for some number 7 > and distribution p{w)p{xi\w), where the mutual informations and entropies 
are evaluated using xi, t) = p{w)p{xi\w)Vi{yi\xi)V2{t\xi,X2). 

B. Achievability Result 

The achievability result derived in this subsection is valid for any Z-interference channel. We design 
a codebook at Transmitter 1 such that the inner codebook carries the side information at the Receiver 2, 
i.e.. Wis, and part of Wip, and the outer codebook carries the remaining part of Wip. More specifically, 
the inner codebook is of rate Ru + 7, and the outer codebook is of rate -Rip — 7. Then, we have the 
achievable rate region as the union over all p{w)p{xi\w)p{x2) of 

Rip < H{Yi\W) + 7 - H{Yi\Xi) (192) 

Ris + Rip<I{Xi-Yi) (193) 

R2p<I{X2]Y2\W) anA (194) 

R2p<I{W,X2;Y2)-i, (195) 
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where the mutual informations are evaluated using 

p{w, xi, X2, Vi, 2/2) = p{w)pixi\w)p{x2)Vi{yi\xi)V2iy2\xi, X2). 

C. Capacity Region 

Making use of Conditions 1 and 2 in the exact same way as in [13, Section V], we can show 
that the converse result in (|189l) - (|191| ) and the achievability result in (|192|) - (|195l ) are the same for 
Z-interference channels satisfying Conditions 1 and 2, and hence the capacity region Cj in this case 
is given in Lemma |2l 
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